Modelling Relational Statistics With Bayes Nets (Poster Presentation SRL Workshop)
نویسندگان
چکیده
Class-level dependencies model general relational statistics over attributes of linked objects and links. Class-level relationships are important in themselves, and they support applications like policy making, strategic planning, and query optimization. An example of a class-level query is “what is the percentage of friendship pairs where both friends are women?”. To represent class-level statistics, we utilize Parametrized Bayes nets (PBNs), a 1st-order logic extension of Bayes nets. The standard grounding semantics for PBNs is appropriate for answering queries about specific ground facts but not appropriate for answering queries about classes of individuals. We propose a novel random selection semantics for PBNs, based on Halpern’s classic semantics for probabilistic 1st-order logic (Halpern, 1990), that supports classlevel queries. For parameter learning we use the empirical frequencies in the relational data. A naive computation of the empirical frequencies of the relations is intractable due to the complexity imposed by negated relations. We render the computation tractable by using the Möbius transform. Evaluation on four benchmark datasets indicates that maximum pseudo-likelihood provides accurate estimates at different sample sizes.
منابع مشابه
Challenge Paper : Marginal Probabilities for Instances and Classes ( Poster Presentation SRL Workshop )
In classic AI research on combining logic and probability, Halpern introduced an inference principle for marginal probabilities of ground atoms: If the corresponding population frequency is known, the marginal probability should be equal to it. For instance, if the only thing we know about Tweety is that Tweety is a bird, then the probability that Tweety flies should be the frequency of flyers ...
متن کاملLearning Class-Level Bayes Nets for Relational Data
Many databases store data in relational format, with different types of entities and information about links between the entities. The field of statistical-relational learning (SRL) has developed a number of new statistical models for such data. In this paper we focus on learning class-level or first-order dependencies, which model the general database statistics over attributes of linked objec...
متن کاملView Learning for Statistical Relational Learning: With an Application to Mammography
Statistical relational learning (SRL) constructs probabilistic models from relational databases. A key capability of SRL is the learning of arcs (in the Bayes net sense) connecting entries in different rows of a relational table, or in different tables. Nevertheless, SRL approaches currently are constrained to use the existing database schema. For many database applications, users find it profi...
متن کاملClass-Level Bayes Nets for Relational Data
Many databases store data in relational format, with different types of entities and information about links between the entities. The field of statistical-relational learning has developed a number of new statistical models for such data. Most of these models aim to support instance-level predictions about the attributes or links of specific entities. In this paper we focus on learning class-l...
متن کاملLearning Bayes Nets for Relational Data With Link Uncertainty Extended Abstract
We present an algorithm for learning correlations among link types and node attributes in relational data that represent complex heterogeneous networks. The link correlations are represented in a Bayes net structure. The current state of the art algorithm for learning relational Bayes nets captures only correlations among entity attributes given the existence of links among entities. The models...
متن کامل